Search CORE

27 research outputs found

Filtering and Polarity Detection for Reputation Management on Tweets

Author: Farkas Richárd
Hangya Viktor
Publication venue
Publication date: 01/01/2013
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

Target-Level Sentiment Analysis on Various Genres

Author: Hangya Viktor
Publication venue
Publication date: 07/02/2020
Field of study

SZTE Doktori Értekezések Repozitórium (SZTE Repository of Dissertations)

Latent Syntactic Structure-Based Sentiment Analysis

Author: Farkas Richárd
Hangya Viktor
Szántó Zsolt
Publication venue
Publication date: 08/09/2017
Field of study

People share their opinions about things like products, movies and services using social media channels. The analysis of these textual contents for sentiments is a gold mine for marketing experts, thus automatic sentiment analysis is a popular area of applied artificial intelligence. We propose a latent syntactic structure-based approach for sentiment analysis which requires only sentence-level polarity labels for training. Our experiments on three domains (movie, IT products, restaurant) show that a sentiment analyzer that exploits syntactic parses and has access only to sentence-level polarity annotation for in-domain sentences can outperform state-of-the-art models that were trained on out-domain parse trees with sentiment annotation for each node of the trees. In practice, millions of sentence-level polarity annotations are usually available for a particular domain thus our approach is applicable for training a sentiment analyzer for a new domain while it can exploit the syntactic structure of sentences as well

Repository of the Academy's Library

Extending Multilingual Machine Translation through Imitation Learning

Author: Fraser Alexander
Hangya Viktor
Lai Wen
Publication venue
Publication date: 14/11/2023
Field of study

Despite the growing variety of languages supported by existing multilingual neural machine translation (MNMT) models, most of the world's languages are still being left behind. We aim to extend large-scale MNMT models to a new language, allowing for translation between the newly added and all of the already supported languages in a challenging scenario: using only a parallel corpus between the new language and English. Previous approaches, such as continued training on parallel data including the new language, suffer from catastrophic forgetting (i.e., performance on other languages is reduced). Our novel approach Imit-MNMT treats the task as an imitation learning process, which mimicks the behavior of an expert, a technique widely used in the computer vision area, but not well explored in NLP. More specifically, we construct a pseudo multi-parallel corpus of the new and the original languages by pivoting through English, and imitate the output distribution of the original MNMT model. Extensive experiments show that our approach significantly improves the translation performance between the new and the original languages, without severe catastrophic forgetting. We also demonstrate that our approach is capable of solving copy and off-target problems, which are two common issues existence in current large-scale MNMT models

arXiv.org e-Print Archive

Aspektusszintű annotáció és szentimentet módosító elemek egy magyar nyelvű szentimentkorpuszban

Author: Hangya Viktor
Szabó Martina Katalin
Vincze Veronika
Publication venue: Szegedi Tudományegyetem
Publication date: 01/01/2016
Field of study

Repository of the Academy's Library

Doménspecifikus polaritáslexikonok automatikus előállítása magyar nyelvre

Author: Farkas Richárd
Hangya Viktor
Publication venue
Publication date: 01/01/2015
Field of study

Napjainkban a közösségi média jelentős népszerűségre tett szert, szinte bármilyen témakörben nagy mennyiségű szöveg érhető el. Ennek köszönhetően nagy figyelmet kaptak a különböző véleménydetekciós módszerek, melyek feladata szövegek osztályozása azok tartalmának polaritása alapján. A feladat megoldása során segítséget nyújtanak az ún. polaritáslexikonok, melyek az egyes szavak polaritására nézve hordoznak információkat. Munkánkban bemutatunk különböző módszereket lexikonok előállítására, valamint azok kiegészítésére és adaptálására más doménekre. Vizsgálatainkat kifejezetten számítástechnikai eszközökkel kapcsolatos véleményeken és általános hírekből származó szövegeken végeztük el, melyekből kiderül, hogy az osztályozás pontosságára nézve a megfelelő lexikon kiválasztása meghatározó

University of Szeged

Combining Word Embeddings with Bilingual Orthography Embeddings for Bilingual Dictionary Induction

Author: Fraser Alexander
Hangya Viktor
Schütze Hinrich
Severini Silvia
Publication venue
Publication date: 01/01/2020
Field of study

Crossref

Open Access LMU

A Hungarian Sentiment Corpus Manually Annotated at Aspect Level

Author: Hangya Viktor
Simkó Katalin Ilona
Szabó Martina Katalin
Varga Viktor
Vincze Veronika
Publication venue: European Language Resources Association (ELRA)
Publication date: 01/01/2016
Field of study

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

Repository of the Academy's Library

Entitásorientált véleménydetekció webes híranyagokból

Author: Berend Gábor
Farkas Richárd
Hangya Viktor
Publication venue
Publication date: 01/01/2015
Field of study

Napjainkban a hírközlés jelentős hányada digitális formában történik, a híranyagokban említett entitásokra vonatkozó vélemények polaritásának automatikus meghatározása pedig komoly előnyökkel járhat. Éppen ezért munkánk során az OpinHuBank adatbázisban található entitásokra vonatkozó vélemények bekategorizálását tűztük ki feladatunkul. A javasolt megoldásunk többek között a szövegegységek dependenciaelemzésére is támaszkodva képes az entitások mondatbeli szerepének figyelembevételével pontosabb képet adni a rájuk vonatkozó véleményekről

SZTE Publicatio Repozitórium - SZTE - Repository of Publications

University of Szeged